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1 

Method for Producing a Polynucleotide Library 

Field of Invention 

A method for making a plurality of new recombined 
5 polynucleotides by using a mismatch repair protein (s) or 
enzyme (s) to recombine at least two variants of the same 
polynucleotide or at least two homologous polynucleotides by 
Recombinatorial Chain Reaction, RCR. 

10 Background of the Invention 

The mismatch repair system is a system within cells which 
recognizes strand- strand nucleotide mismatches in newly 
.synthesized duplex DNA sequences by comparing the new 
... polynucleotide strand with the ^old" polynucleotide strand 
.:i5.^ originating from the parental duplex DNA, especially following 

DNA replication. The mismatch repair system of e.g. Escherichia 
. 1 . corrects the strand-strand nucleotide mismatches by using 

the methylated '"old'' strain of the new duplex DNA as a 
template. 

20 Independently of the molecular mechanism, the mismatch 

repair system normally limits the genetic diversity within a 
cell; where diversity in this context means the number of 
different DNA sequences. For example, a heteroduplex 
polynucleotide which comprises a single mismatch represents a 

25 diversity of two, since after one round of replication, the 
heteroduplex with the mismatch will have become two different 
double- stranded homoduplexes (with a one base pair difference 
between the two, originating from the mismatch in the parental 
heteroduplex) . 

30 However if the mismatch repair system corrects the 

mismatch in a heteroduplex before replication, the result will 
be two identical homoduplex DNA sequences, consequently the 
genetic diversity would be reduced to only one. 

Several strategies and methods for generating genetic 

35 diversity are known in the art, such as classical random 
mutagenesis, site -directed mutagenesis, gene-shuf fling etc. 
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However, there is still a need for new methods and ways to 
produce diverse polynucleotide sequences that may encode 
polypeptides with new properties or may have new properties 
themselves • 

5 The state of the art shuffling methods are very efficient 

in shuffling polynucleotides comprising mutations that are 
located far apart in the polynucleotide sequences. However, 
shuffling or recombining mutations that are positioned in 
relative close vicinity within a polynucleotide molecule has 

10 remained a challenge so far. 

The present invention provides a method of utilizing a 
mismatch repair protein{s) or enzyme (s) to increase the genetic 
diversity in a polynucleotide population from a starting 
material of at least two homologous polynucleotides, to obtain 

15 a plurality of new recombined homologous polynucleotides. The 
method of the invention even allows for the shuffling or 
recombining of homologous polynucleotide sequences, where the 
sequence variation (s) between the at least two parental 
starting sequences are closely located in the polynucleotide 

20 sequence . 

The method of the invention utilizes a mismatch repair 
protein (s) as known in the art (Biswas and Hsieh, 1996, 
Identification and Characterization of a Thermostable MutS 
Homologue from Thermus aquaticus, J Biol Chem 271 (9) : 5040-5048) 
25 and {Sugahara et al . , 2000, Crystal structure of a repair 

enzyme of oxidatively damaged DNA, MutM (Fpg) , from an extreme 
thermophile, Thennus thermophilus HB8, J EMBO 19(15) :3857- 
3869) . 

30 Summary of the Invention: 

The problem to be solved by the present invention is how 
to generate diverse polynucleotide libraries that comprise new 
recombined polynucleotides, from a starting material comprising 
homologous template polynucleotides. A cell population 
35 comprising such a library may then be used to screen for a 
particular property/activity of interest encoded by a 
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polynucleotide which can be selected on this basis. Also 
polynucleotide sequences with particular changed or improved 
properties might be selected, such as promoters, terminators 
and other regulatory elements. 
5 The present inventor provides a method for increasing the 

genetic diversity from a starting material of at least two 
homologous double -stranded polynucleotides or at least two 
variants of the same double -stranded polynucleotides such as 
two different DNA sequences encoding homologous polypeptides 
10 e.g. enzymes or pharmaceutically active peptides. 

As mentioned above, the present invention even allows 
shuffling or recombining of homologous polynucleotide 
sequences, where the sequence variation (s) between the at least 
two parental starting sequences are closely located in the 
15 polynucleotide secjuence e.g. the two starting sequences may 

comprise variations that are only one or a few nucleotides away 
from each other. 

Optionally the steps (b) through (d) of the method of the 
present invention may be repeated for one or more cycles; 
20 wherein the new duplexes of step (d) serve as new template 
polynucleotides in step (b) in each subsequent cycle. 
Increasing the number of repeats or cycles will result in an 
increase in the number of new recombined polynucleotides, as 
new permutations of mismatches will be generated in the 
25 annealing step of each cycle. 

Accordingly, in a first aspect the present invention 
relates to a method for forming a plurality of recombined 
homologous double -stranded polynucleotides from at least two 
homologous double -stranded template polynucleotides, said 
30 method comprising the steps of: 

a) providing a solution comprising at least two non- 
methylated homologous double- stranded template 
polynucleotides and one or more mismatch repair 
protein (s) ; 

35 b) denaturing the template polynucleotides into single - 
stranded polynucleotides; 
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c) annealing the different single -stranded polynucleotides, 
wherein heteroduplexes are formed; 

d) allowing the mismatch repair protein (s) to repair 
nucleotide mismatches in the heteroduplexes, wherein 

5 recombined new duplexes are formed; and 

e) optionally, repeating steps b) through d) for one or more 
cycles; wherein the new duplexes of step d) serve as new 
template polynucleotides in step b) in each subsequent 
cycle . 

10 In a second aspect the present invention relates to a 

plurality of recombined nucleotides generated by a method as 

defined in the first aspect . 

A library of recombined polynucleotides generated by the 

method of the invention may be screened for a particular 
15 activity or property of interest, and a polynucleotide may be 

selected based on the results of such a screening . 

Accordingly, in a third aspect the invention relates to a 

recombined polynucleotide generated by a method as defined in 

the first aspect. 
20 Also, in a fourth aspect the invention relates to the use 

of a plurality of recombined polynucleotides of the second 

aspect generated by a method as defined in the first aspect, in 

a screening assay for an activity or property of interest. 

In a final aspect the invention relates to the use of a 
25 recombined polynucleotide of the third aspect generated by a 

method as defined in the first aspect, for expression or 

production of a polypeptide of interest. 



Definitions 

30 Following section provides definitions of technical 

features in above mentioned aspects of the invention. 

The term '^a gene" denotes herein a gene (a 
polynucleotide) which is capable of being expressed into a 
polypeptide within a living cell or by an appropriate 

35 expression system. Accordingly, said gene is defined as an open 
reading frame starting from a start codon (normally *^ATG" , 
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"^GTG" , or *'TTG") and ending at a stop codon (normally ^"TAA" , 
TAG" or "TGA") . In order to express said gene there must be 
elements, as known in the art, in connection with the gene, 
necessary for expression of the gene within the cell. Such 
5 standard elements may include a promoter, a ribosomal binding 
site, a termination sequence, and maybe others elements as 
known in the art , 

The term ^^substantially pure polynucleotide" as used 
herein refers to a polynucleotide preparation, wherein the 

10 polynucleotide has been removed from its natural genetic 

milieu, and is thus free of other extraneous or unwanted coding 
sequences and is in a form suitable for use within genetically 
engineered protein production systems. 

Thus, a substantially pure polynucleotide contains at the 

15 most 10% by weight of other polynucleotide material with which 
it is natively associated (lower percentages of other 
polynucleotide material are preferred, e.g. at the most 8% by 
weight, at the most 6% by weight, at the most 5% by weight, at 
the most 4% at the most 3% by weight, at the most 2% by weight, 

20 at the most 1% by weight, and at the most M% by weight) . A 
substantially pure polynucleotide may, however, include 
naturally occurring 5' and 3' untranslated regions, such as 
promoters and terminators . 

It is preferred that the substantially pure polynucleotide 

25 is at least 92% pure, i.e. that the polynucleotide constitutes 
at least 92% by weight of the total polynucleotide material 
present in the preparation, and higher percentages are 
preferred such as at least 94% pure, at least 95% pure, at 
least 96% pure, at least 96% pure, at least 97% pure, at least 

30 98% pure, at least 99%, and at the most 99.5% pure. 

The polynucleotides disclosed herein are preferably in a 
substantially pure form. In particular, it is preferred that 
the polynucleotides disclosed herein are in ^^essentially pure 
form", i.e. that the polynucleotide preparation is essentially 

35 free of other polynucleotide material with which it is natively 
associated. Herein, the term ^^substantially pure 
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polynucleotide" is synonymous with the terms ^^isolated 
polynucleotide" and "polynucleotide in isolated form" . 

The term ^'homologous" in the present context means that 
the two homologous polynucleotides or polypeptides have a 
5 **degree of identity" of at least 60%, more preferably at least 
70%, even more preferably at least 85%, still more preferably 
at least 90%, more preferably at least 95%, and most preferably 
at least 98%. Whether two polynucleotide or polypeptide 
sequences have a sufficiently high degree of identity to be 

10 homologous as defined herein, can suitably be investigated by 
aligning the two sequences using a computer program known in 
the art, such as ^'GAP" provided in the GCG program package 
(Program Manual for the Wisconsin Package, Version 8, August 
1994, Genetics Computer Group, 575 Science Drive, Madison, 

15 Wisconsin, USA 53711) (Needleman, S.B. and Wunsch, CD., 

(1970) , Journal of Molecular Biology, 48, 443-4 53) . Using GAP 
with the following settings for DNA sequence comparison: GAP 
creation penalty of 5.0 and GAP extension penalty of 0.3. 

A ^^heteroduplex" is used herein as having the meaning 

20 known in the art, which means that a heteroduplex is a double- 
stranded polynucleotide, such as a double -stranded DNA- 
molecule, wherein several base- or nucleotide-pairs are 
mismatched or, in other words, the two strands are not 
perfectly complementary. 

25 The term **homoduplex" has the well -described meaning known 

in the art, a double -stranded polynucleotide wherein the two 
strands are perfectly complementary and no nucleotide -pair 
mismatches are found i.e. all adenosines pair with a thymidine 
(A's pair with T's) and all guanosines pair with a cytidine 

30 (G's pair with C's) • 

The term ^^duplex" as used herein is defined as a double- 
stranded polynucleotide which may be either a hetero- or a 
homoduplex polynucleotide as defined above. 

The term ^^denaturing" is used herein as known in the art, 

35 for example a double- stranded polynucleotide comprised in a 
liquid solution may be denatured by heating the solution to at 
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least the melting-point or melting- temperature of the double- 
stranded polynucleotide and keeping the solution at that 
temperature until the double -stranded polynucleotide has 
denatured, separated, or "melted" into two complementary 
5 single-stranded polynucleotides - 

"Annealing" as used herein means that conditions such as 
temperature and salt -concentrations in a liquid solution are so 
that a single -stranded polynucleotide comprised in the solution 
will anneal preferentially to another single-stranded 

10 homologous polynucleotide comprised in the solution, in other 
words polynucleotides that are not homologous will not anneal 
to any significant extent. 

"^Nucleic acid construct'^ when used herein, the term 
nucleic acid construct means a nucleic acid molecule, either 

15 single- or double -stranded, which is isolated from a naturally 
occurring source or which has been modified to contain segments 
of nucleic acids in a manner that would not otherwise exist in 
nature. The term nucleic acid construct is synonymous with the 
term ^^expression cassette" when the nucleic acid construct 

20 contains the control sequences required for expression of a 
coding sequence of the present invention. 

Control sequence'' is defined herein to comprise all 
components that are necessary or advantageous for the 
expression of a polynucleotide of the present invention. Each 

25 control sequence may be native or foreign to the nucleotide 
sequence encoding the polypeptide. Such control sequences 
include, but are not limited to, a leader, polyadenylation 
sequence, propeptide sequence, promoter, signal peptide 
sequence, and transcription terminator. At a minimum, the 

30 control sequences include a promoter, and transcriptional and 
translational stop signals. The control sequences may be 
provided with linkers for the purpose of introducing specific 
restriction sites facilitating ligation of the control 
sequences with the coding region of the nucleotide sequence 

35 encoding a polypeptide. 
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''Operably linked" is defined herein as a configuration in 
which a control sequence is appropriately placed at a position 
relative to the coding sequence of the polynucleotide sequence 
such that the control sequence directs the expression of the 
5 polynucleotide. 

"Coding sequence" is intended to cover a polynucleotide 
sequence, which directly specifies the amino acid sequence of 
its protein product. The boundaries of the coding sequence are 
generally determined by an open reading frame, which usually 
10 begins with the ATG start codon. The coding sequence typically 
include DNA, cDNA, and recombinant nucleotide sequences. 

In the present context, the term ''expression" includes 
any step involved in the production of a polypeptide including, 
but not limited to, transcription, post -transcriptional 
15 modification, translation, post-translational modification, and 
secretion. 

In the present context, the term ''expression vector" 
covers a polynucleotide molecule, linear or circular, that 
comprises a polynucleotide segment encoding a polypeptide of 

20 interest, and which is operably linked to additional segments 
that provide for the expression. 

The term "host cell", as used herein, is defined below. 
The term "mismatch repair system" shall herein be 
understood according to the art, as a system normally present 

25 within cells which recognises mismatches in duplex DNA 
sequences; see e.g. WO 97/37011, page 1, line 21-28, The 
mismatch repair system either corrects the mismatches by e.g. 
using the methylated "old" strain as template or alternatively 
the system may mediate degradation of the duplex DNA sequences 

30 which comprise the mismatches. Independently of the precise 
molecular mechanism, the end result will be that the "mismatch 
repair system" normally limits the "diversity" within the cell, 
represented by those duplex DNA sequences that conprise 
mismatches. The instant invention however utilizes the very 

35 base pair mismatch -correcting property of the mismatch repair 
system to generate diversity instead of limiting it. When non- 
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methylated double-stranded polynucleotides comprising 
mismatches are treated with a mismatch repair protein (s), the 
result will be unpredictable error-corrections in both strands, 
as there is no discernable template strand for the protein (s) 
5 to use for proofreading. This means that various new 

nucleotides may be introduced in either polynucleotide strand 
of the heteroduplex in the process of forming a new recombined 
duplex with a reduced number of mismatches or no mismatches at 
all. The mismatch repair system preferably coirprises a MutS 

10 homologue, preferably MutS YTl of Thermus aquaticus, or 

comprises a MutL homologue, a MSH2 homologue, a MSH6 homologue, 
a MutM homologue, a MutY homologue, a MutT homologue, a MutH 
homologue, a HexA homologue, a HexB homologue, or a GTBP/pl60 
homologue (Biswas and Hsieh, 1996, vide supra) . 

15 The term ^'solution'' denotes any liquid solution, such as 

an aqueous solution, comprising the at least two homologous 
double -stranded template polynucleotides and one or more 
mismatch repair protein (s). 

The term ^*DNA library", ''polynucleotide library'', or 

20 ''plurality of polynucleotides" denotes herein a library of at 
least two different DNA sequences. For many practical purposes 
the library is much bigger. Accordingly, the DNA library 
preferably comprises at least 1000 different DNA sequences, 
more preferably at least 10000 different DNA sequences, and 

25 even more preferably at least 100000 different DNA sequences. 
In the present context, the term ^^allelic variant" 
denotes any of two or more alternative forms of a gene 
occupying the same chromosomal locus. Allelic variation arises 
naturally through mutation, and may result in polymorphism 

30 within populations. Gene mutations can be silent (no change in 
the encoded polypeptide) or may encode polypeptides having 
altered amino acid sequences. An allelic variant of a 
polypeptide is a polypeptide encoded by an allelic variant of a 
gene . 

35 The term "thermostable" protein (s) in the present context 

means that the protein (s) remains essentially functional after 
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having been exposed to the relatively high temperatures needed 
to denature the double-stranded polynucleotides in step (b) of 
the method of the invention. Specifically the thermostable 
protein (s) retains from at least 60% to 80% of its activity at 
5 its optimum temperature after one denaturing step; wherein the 
activity may be determined by the ATP-hydrolysis (ATPase) assay 
described in (Biswas and Hsieh, 1996, vide supra) which is 
incorporated herein by reference. 

10 Detailed Description of the Invention 

A method for forming a plurality of recombined homologous 
double -stranded polynucleotides from at least two homologous 
double -stranded template polynucleotides according to the first 
aspect of the invention, 

15 The techniques used to isolate or clone a polynucleotide 

sequence are known in the art and include isolation from 
genomic DNA, preparation from cDNA, or a combination thereof. 
The cloning of the polynucleotide sequences of the present 
invention from such genomic DNA can be effected, e.g-. , by using 

20 the well known polymerase chain reaction (PGR) , expression 
cloning, or antibody screening of expression libraries to 
detect cloned DNA fragments with shared structural features. 
See, e.g., Innis at al . , 1990, PCR: A Guide to Methods and 
Application, Academic Press, New York. Other amplification 

25 procedures such as ligase chain reaction (LCR) , ligated 

activated transcription (LAT) and nucleotide sequence-based 
amplification (NASBA) may be used. The nucleotide sequence may 
be cloned from a bacterial or fungal strain or another or 
related organism and thus, for example, may be an allelic or 

30 species variant of the polypeptide encoding region of the 
nucleotide sequence . 

The polynucleotide sequence may be obtained by standard 
cloning procedures used in genetic engineering to relocate the 
polynucleotide sequence from its natural location to a 

35 different site where it will be reproduced. The cloning 
procedures may involve excision and isolation of a desired 
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polynucleotide fragment comprising the polynucleotide sequence 
of interest, insertion of the fragment into a vector molecule, 
and incorporation of the resulting recombinant vector into a 
host cell where multiple copies or clones of the polynucleotide 
5 sequence will be replicated. The polynucleotide sequence may 
be of genomic, cDNA, RNA, semi synthetic, synthetic origin, or 
any combinations thereof. 

Accordingly a preferred embodiment of the invention 
relates to a method of the first aspect, wherein the at least 

10 two homologous double -stranded template polynucleotides are 
obtained by PGR amplification. 

There is a substantial commercial interest in 
polypeptides such as pharmaceutically active peptides or 
industrial enzymes, and there is much research focused on 

15 changing or improving the properties or activities of such 
polypeptides. Terms like '^protein engineering" or ^^gene 
shuffling" are frequently encountered in the art. The present 
invention provides a new way of recombining polynucleotide 
sequences without having to fragment the template 

20 polynucleotides or synthesize a large number of overlapping 
primers to be used in a PGR reaction etc. 

The method of the invention allows specific non- 
determined sequence variations to be recombined between 
homologous annealed polynucleotides via the action of a 

25 mismatch repair protein (s) which exchanges nucleotides in the 
polynucleotide sequences where there is a mismatch between the 
homologous strands, to provide new recombined polynucleotide 
strands, thus increasing the genetic diversity, A requirement 
for the method to function is that at least two homologous 

30 polynucleotide strands are able to anneal under the conditions 
given and this ability will largely depend on the degree of 
identity between the two polynucleotide strands which should 
preferably be at least 60%, more preferably at least 70%, even 
more preferably at least 85%, still more preferably at least 

35 90%, more preferably at least 95%, and most preferably at least 
98%. 
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It is well known in the art that polynucleotide sequences 
encoding certain polypeptides with similar properties or 
activities, such as enzymes, are often highly homologous. The 
homologous polynucleotides and polypeptides may be species 
5 variants or allelic variants descending from a common ancestral 
sequence which have evolved separately to the present day, 

A template polynucleotide may encode an enzymatic 
polypeptide e.gr. an aminopeptidase, an amylase, a carbohydrase, 
a carboxypeptidase, a catalase, a cellulase, a chitinase, a 

10 cutinase, a cyclodextrin glycosyltransf erase, a 

deoxyribonuclease, an esterase, an alpha-galactosidase, a beta- 
galactosidase, a glucoamylase, an alpha-glucosidase, a beta- 
glucosidase, a haloperoxidase, an invertase, a laccase, a 
lipase, a mannosidase, an oxidase, a pectinolytic enzyme, a 

15 peroxidase, a phytase, a polyphenoloxidase, a proteolytic 
enzyme, a ribonuclease, or a xylanase. 

Consequently, a preferred embodiment of the invention 
relates to the method of the first aspect, wherein the at least 
two homologous double -stranded template polynucleotides encode 

20 homologous polypeptides, preferably having a degree of identity 
of at least 60%, more preferably at least 70%, even more 
preferably at least 85%, still more preferably at least 90%, 
more preferably at least 95%, and most preferably at least 98%. 
Another preferred embodiment of the invention relates to 

25 a method of the first aspect, wherein the at least two 

homologous double -stranded template polynucleotides encode 
homologous enzymes, preferably amylases, proteases, cellulases, 
lipases, xylanases, or phospholipases . 

The homologous template polynucleotides may be comprised 

30 in a population of host cells which do not methylate 

polynucleotides or the gene encoding the mismatch repair 
protein (s) may be comprised in the same population of cells or 
in another cell population so that the cells produce the repair 
protein(s). The cells may secrete the repair protein(s) or the 

35 cells may produce the repair protein (s) intracellularly ; in the 
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latter case it may be an advantage to lyse the cells prior to 
step (b) of the method of the invention. 

Accordingly a preferred embodiment relates to a method of 
the first aspect, wherein the solution comprises a population 
5 of cells or a lysate of a population of cells. 

Further, a preferred embodiment relates to a method of 
the first aspect, wherein the population of cells or the lysate 
of a population of cells comprises the at least two homologous 
double -stranded template polynucleotides - 
10 Still another preferred embodiment relates to a method of 

the first aspect, wherein the population of cells or the lysate 
of a population of cells comprises the mismatch repair 
protein (s) - 

Yet another preferred embodiment relates to a method of 

15 the first aspect, wherein the population of cells, or the 

population of cells giving rise to the lysate, do not methylate 
newly synthesized polynucleotides. 

As mentioned previously, the denaturing and annealing 
steps in the method of the invention can be achieved by raising 

20 and subsequently lowering the temperature of the solution, 
however that would require the mismatch repair protein (s) to 
remain essentially functional after having been exposed to the 
relatively high temperatures needed to denature the double- 
stranded polynucleotides. It may be advantageous to use a 

25 thermostable mismatch repair protein (s) . 

Accordingly a preferred embodiment relates to a method of 
the first aspect, wherein the mismatch repair protein (s) is 
(are) thermostable, preferably the thermostable mismatch repair 
protein (s) comprises a MutS homologue, preferably MutS YTl of 

30 Thermus aquaticus, and more preferably the thermostable 

mismatch repair protein (s) comprises a MutL homologue, a MSH2 
homologue, a MSH6 homologue, a MutM homologue, a MutY 
homologue, a MutT homologue, a MutH homologue, a HexA 
homologue, a HexB homologue, or a GTBP/pl60 homolog. 



10070. 010-DK 




14 

As mentioned previously the denaturing step in the method 
of the invention may be achieved by increasing the temperature 
of the solution. 

Accordingly a preferred embodiment relates to a method of 
5 the first aspect, wherein the denaturing is achieved by 

increasing the temperature of the solution, preferably to at 
least SO^C, more preferably to at least 91°C, more preferably to 
at least 92^C, even more preferably to at least 93^C, still more 
preferably to at least 94*^C, more preferably to at least 95^C, 

10 and most preferably to at least 96°C, 

As also mentioned above, the annealing step in the method 
of the invention may be performed by lowering the temperature 
of the solution, preferably by lowering the temperature at 
least to a temperature where the complementary homologous 

15 single-stranded polynucleotides pref errent ially anneal to each 
other and where the mismatch repair protein (s) functions. 

Accordingly a preferred embodiment relates to a method of 
the first aspect, wherein the annealing is achieved by lowering 
the temperature of the solution, preferably at least to a 

20 temperature at which the micmatch repair protein (s) functions, 
more preferably at least to between 45^C and 85^C, more 
preferably at least to between 50^C and 80^C, more preferably at 
least to between 55°C and 75°C, and most preferably at least to 
between 60®C and 70^C, 

25 We previously mentioned that the steps (b) through (d) of 

the method of the present invention may optionally be repeated 
for one or more cycles; wherein the new duplexes of step (d) 
serve as new template polynucleotides in step (b) in each 
subsecjuent cycle. Increasing the number of repeats or cycles 

30 will result in an increase in the number of new recombined 
polynucleotides, as new permutations of mismatches will be 
generated in the annealing step of each cycle. 

Consequently a preferred embodiment relates to a method 
of the first aspect, wherein steps b) through d) are repeated 

35 for between 1 and 10 cycles; wherein the new duplexes of step 
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d) serve as new template polynucleotides in step b) in each 
subsequent cycle . 

Another preferred embodiment relates to a method of the 
first aspect, wherein steps b) through d) are repeated for at 
5 least 10 cycles; wherein the new duplexes of step d) serve as 
new template polynucleotides in step b) in each subsequent 
cycle . 

A polynucleotide library obtained by the method of the 
invention. may be expressed and assayed in a screen for a 

10 particular property/activity of interest encoded by a 
polynucleotide which can be selected on this basis. Also 
polynucleotide sequences with particular changed or improved 
properties might be selected, such as promoters, terminators 
and other regulatory elements. 

15 The present invention also relates to nucleic acid 

constructs comprising a nucleotide sequence of the present 
invention operably linked to one or more control secjuences that 
direct the expression of the coding sequence in a suitable host 
cell under conditions compatible with the control sequences. 

20 A polynucleotide sequence of the present invention may be 

manipulated in a variety of ways to provide e.g. for expression 
of an encoded polypeptide. Manipulation of the nucleotide 
sequence prior to its insertion into a vector may be desirable 
or necessary depending on the expression vector. The 

25 techniques for modifying nucleotide sequences utilizing 
recombinant DNA methods are well known in the art. 

The control sequence may be an appropriate promoter 
sequence, a nucleotide sequence which is recognized by a host 
cell for expression of the nucleotide sequence. The promoter 

30 sequence contains transcriptional control sequences, which 

mediate the expression of the polypeptide. The promoter may be 
any nucleotide sequence which shows transcriptional activity in 
the host cell of choice including mutant, truncated, and hybrid 
promoters, and may be obtained from genes encoding 

35 extracellular or intracellular polypeptides either homologous 
or heterologous to the host cell. 
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Examples of suitable promoters for directing the 
transcription of the nucleic acid constructs of the present 
invention, especially in a bacterial host cell, are the 
promoters obtained from the E, coli letc operon, Streptomyces 
5 coelicolor agarase gene (dagA) , Bacillus subtilis levansucrase 
gene {sacB) , BcLcillus licheniforwis alpha-amylase gene (amyL) , 
Bacillus stearothezTnophilus maltogenic amylase gene {amyM) , 
Bacillus emyloliquefaciens alpha-amylase gene (amyQ) , Bacillus 
licheniformis penicillinase gene (penP) , Bacillus subtilis xylA 

10 and xylB genes, and prokaryotic beta- lactamase gene (Villa- 
Kamaroff et al., 1978, Proceedings of the National Academy of 
Sciences USA 75: 3727-3731), as well as the tac promoter 
(DeBoer et al., 1983, Proceedings of the National Academy of 
Sciences USA 80: 21-25), Further promoters are described in 

15 "Useful proteins from recombinant bacteria" in Scientific 
American, 1980, 242: 74-94; and in Sambrook et al . , 1989, 
supra . 

Examples of suitable promoters for directing the 
transcription of the nucleic acid constructs of the present 

20 invention in a filamentous fungal host cell are promoters 
obtained from the genes for Aspergillus oryzae TAKA amylase, 
Rhizomucor miehei aspartic proteinase, Aspergillus niger 
neutral alpha-amylase, Aspergillus niger acid stable alpha- 
amylase, Aspergillus niger or Aspergillus awamori glucoamylase 

25 iglaA) , Rhizomucor miehei lipase, Aspergillus oryzae alkaline 
protease, Aspergillus oryzae triose phosphate isomerase, 
Aspergillus nidulans acetamidase, and Fusarium oxysporum 
trypsin-like protease (WO 96/00787), as well as the NA2-tpi 
promoter (a hybrid of the promoters from the genes for 

30 Aspergillus niger neutral alpha-amylase and Aspergillus oryzae 
triose phosphate isomerase), and mutant, truncated, and hybrid 
promoters thereof. 

In a yeast host, useful promoters are obtained from the 
genes for Saccharomyces cerexrisiae enolase (ENO-1) , 

35 Saccharomyces cerevisiae galactokinase (GALl) , Saccharomyces 



10070. 010-DK 



17 

cerevisiae alcohol dehydrogenase/glyceraldehyde- 3 -phosphate 
dehydrogenase (ADH2/GAP) , and Saccharomyces cerevisiae 3- 
phosphoglycerate kinase. Other useful promoters for yeast host 
cells are described by Romanes et al . , 1992, yeast 8: 423-488. 
5 The control sequence may also be a suitable transcription 

terminator sequence, a sequence recognized by a host cell to 
terminate transcription. The terminator sequence is operably 
linked to the 3' terminus of the nucleotide sequence encoding 
the polypeptide. Any terminator which is functional in the 

10 host cell of choice may be used in the present invention. 

Preferred terminators for filamentous fungal host cells 
are obtained from the genes for Aspergillus oryzae TAKA 
amylase, Aspergillus niger glucoamylase, Aspergillus nidulans 
anthranilate synthase, Aspergillus niger alpha -glucosidase, and 

15 Fusarium oxysporum trypsin-like protease. 

Preferred terminators for yeast host cells are obtained 
from the genes for Saccharomyces cerevisiae enolase, 
Saccharomyces cerevisiae cytochrome C (CYCl) , and Saccharomyces 
cerevisiae glyceraldehyde -3 -phosphate dehydrogenase. Other 

20 useful terminators for yeast host cells are described by 
Romanes et al . , 1992, supra. 

The control sequence may also be a suitable leader 
sequence, a nontranslated region of an mRNA which is important 
for translation by the host cell. The leader sequence is 

25 operably linked to the 5' terminus of the nucleotide sequence 
encoding the polypeptide. Any leader sequence that is 
functional in the host cell of choice may be used in the 
present invention. 

Preferred leaders for filamentous fungal host cells are 

30 obtained from the genes for Aspergillus oryzae TAKA amylase and 
Aspergillus nidulans triose phosphate isomerase. 

Suitable leaders for yeast host cells are obtained from 
the genes for Saccharomyces cerevisiae enolase (ENO-1) , 
Saccharomyces cerevisiae 3-phosphoglycerate kinase, 

35 Saccharomyces cerevisiae alpha- factor, and Saccharomyces 
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cerevisiae alcohol dehydrogenase/glyceraldehyde- 3 -phosphate 
dehydrogenase (ADH2/GAP) • 

The control sequence may also be a polyadenylation 
sequence, a sequence operably linked to the 3' terminus of the 
5 nucleotide sequence and which, when transcribed, is recognized 
by the host cell as a signal to add polyadenosine residues to 
transcribed mRNA. Any polyadenylation sequence which is 
functional in the host cell of choice may be used in the 
present invention. 

10 Preferred polyadenylation sequences for filamentous 

fungal host cells are obtained from the genes for Aspergxllus 
oryzae TAKA amylase, Aspergillus niger glucoamylase, 
Aspergillus nidulans anthranilate synthase, Fusskrium oxysporum 
trypsin-like protease, and Aspergillus niger alpha-glucosidase . 

15 Useful polyadenylation sequences for yeast host cells are 

described by Guo and Sherman, 1995, Molecular Cellular Biology 
15: 5983-5990. 

The control sequence may also be a signal peptide coding 
region that codes for an amino acid sequence linked to the 

20 amino terminus of a polypeptide and directs the encoded 

polypeptide into the cell's secretory pathway. The 5' end of 
the coding sequence of the nucleotide sequence may inherently 
contain a signal peptide coding region naturally linked in 
translation reading frame with the segment of the coding region 

25 which encodes the secreted polypeptide. Alternatively, the 5' 
end of the coding sequence may contain a signal peptide coding 
region which is foreign to the coding sequence- The foreign 
signal peptide coding region may be required where the coding 
sequence does not naturally contain a signal peptide coding 

30 region. Alternatively, the foreign signal peptide coding 
region may simply replace the natural signal peptide coding 
region in order to enhance secretion of the polypeptide. 
However, any signal peptide coding region which directs the 
expressed polypeptide into the secretory pathway of a host cell 

35 of choice may be used in the present invention. 
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Effective signal peptide coding regions for bacterial 
host cells are the signal peptide coding regions obtained from 
the genes for Bacillus NCIB 11837 maltogenic amylase, Bacillus 
stearothermophilus alpha -amylase. Bacillus licheniformis 
5 subtilisin. Bacillus licheniformis beta-lactamase, Bacillus 
stearothermophilus neutral proteases {nprT, nprS, nprM) , and 
Bacillus subtilis prsA. Further signal peptides are described 
by Simonen and Palva, 1993, Microbiological Reviews 57: 109- 
137. 

10 Effective signal peptide coding regions for filamentous 

fungal host cells are the signal peptide coding regions 
obtained from the genes for Aspergillus oryzae TAKA amylase, 
Aspergillus niger neutral amylase, Aspergillus niger 
glucoamylase, Rhizomucor miehei aspartic proteinase, Humicola 

15 insolens cellulase, and Humicola lanuginosa lipase. 

Useful signal peptides for yeast host cells are obtained 
from the genes for Saccharomyces cerevisiae alpha- factor and 
Saccharomyces cerevisiae invertase. Other useful signal 
peptide coding regions are described by Romanes et al . , 1992, 

20 supra. 

The control sequence may also be a propeptide coding 
region that codes for an amino acid sequence positioned at the 
amino terminus of a polypeptide. The resultant polypeptide is 
known as a proenzyme or propolypeptide (or a zymogen in some 

25 cases) , A propolypeptide is generally inactive and can be 
converted to a mature active polypeptide by catalytic or 
autocatalytic cleavage of the propeptide from the 
propolypeptide. The propeptide coding region may be obtained 
from the genes for Bacillus subtilis alkaline protease {aprE} , 

30 Bacillus subtilis neutral protease {nprT) , Saccharomyces 

cerevisiae alpha- factor, Rhizomucor miehei aspartic proteinase, 
and Myceliophthora thermophila laccase (WO 95/33836) . 

Where both signal peptide and propeptide regions are 
present at the amino terminus of a polypeptide, the propeptide 

35 region is positioned next to the amino terminus of a 
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polypeptide and the signal peptide region is positioned next to 
the amino terminus of the propeptide region. 

It may also be desirable to add regulatory sequences 
which allow the regulation of the expression of the polypeptide 
5 relative to the growth of the host cell. Examples of 

regulatory systems are those which cause the expression of the 
gene to be turned on or off in response to a chemical or 
physical stimulus, including the presence of a regulatory 
compound. Regulatory systems in prokaryotic systems include 

10 the lac, tac, and trp operator systems. In yeast, the ADH2 
system or GALl system may be used. In filamentous fungi, the 
TAKA alpha-amylase promoter, Aspergillus niger glucoamylase 
promoter, and Aspergillus oryzae glucoamylase promoter may be 
used as regulatory sequences. Other examples of regulatory 

15 sequences are those which allow for gene amplification. In 
eukaryotic systems, these include the dihydrof olate reductase 
gene which is amplified in the presence of methotrexate, and 
the metallothionein genes which are amplified with heavy 
metals. In these cases, the nucleotide sequence encoding the 

20 polypeptide would be operably linked with the regulatory 
sequence . 

Accordingly a preferred embodiment relates to a method of 
the first aspect, wherein additional steps are performed, said 
additional steps comprising: 
25 f ) generating a gene library by cloning the plurality of 

recombined polynucleotides; 

g) expressing and screening the gene library for an activity 
or property of interest; and 

h) isolating or identifying the recombined polynucleotide 

30 which gives rise to the activity or property of interest. 

The present invention also relates to recombinant expression 
vectors comprising the polynucleotides of the invention 
especially when those are comprised in a nucleic acid construct 
such as an expression vector. The various nucleotide and 

35 control sequences described above may be joined together to 
produce a recombinant expression vector which may include one 
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or more convenient restriction sites to allow for insertion or 
substitution of the polynucleotide sequence at such sites. 

Alternatively, a polynucleotide sequence of the present 
invention may be expressed by inserting the nucleotide sequence 
5 or a nucleic acid construct comprising the sequence into an 
appropriate vector for expression. In creating the expression 
vector, the coding sequence is located in the vector so that 
the coding secjuence is operably linked with the appropriate 
control sequences for expression. 

10 The recombinant expression vector may be any vector 

(e.gr., a plasmid or virus) which can be conveniently subjected 
to recombinant DNA procedures and can bring about the 
expression of the nucleotide sequence. The choice of the 
vector will typically depend on the compatibility of the vector 

15 with the host cell into which the vector is to be introduced. 
The vectors may be linear or closed circular plasmids. 

The vector may be an autonomously replicating vector, 
i.e., a vector which exists as an extrachromosomal entity, the 
replication of which is independent of chromosomal replication, 

20 e.g., a plasmid, an extrachromosomal element, a mini chromosome, 
or an artificial chromosome. 

The vector may contain any means for assuring self- 
replication. Alternatively, the vector may be one which, when 
introduced into the host cell, is integrated into the genome 

25 and replicated together with the chromosome (s) into which it 
has been integrated. Furthermore, a single vector or plasmid 
or two or more vectors or plasmids which together contain the 
total DNA to be introduced into the genome of the host cell, or 
a transposon may be used. 

30 The vectors of the present invention preferably contain 

one or more selectable markers which permit easy selection of 
transformed cells. A selectable marker is a gene the product 
of which provides for biocide or viral resistance, resistance 
to heavy metals, prototrophy to auxotrophs, and the like. 

35 Examples of bacterial selectable markers are the dal genes 

from B&cillus subtilis or Bacillus lichenifoinnis, or markers 
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which confer antibiotic resistance such as ampicillin, 
kanamycin^ chloramphenicol or tetracycline resistance - 
Suitable markers for yeast host cells are ADE2, HIS3, LEU2, 
LYS2, MET3, TRPl, and URA3 . Selectable markers for use in a 
5 filamentous fungal host cell include, but are not limited to, 
amdS (acetamidase) , argB (ornithine carbamoyl transferase) , bar 
(phosphinothricin acetyltransf erase) , hygB (hygromycin 
phosphotransferase) , niaD (nitrate reductase) , pyrG (orotidine- 
5 '-phosphate decarboxylase), sC (sulfate adenyltransferase) , 

10 trpC (anthranilate synthase), as well as equivalents thereof. 

Preferred for use in an Aspergillus cell are the amdS and 
pyrG genes of Aspergillus nidulans or Aspergillus oryzae and 
the Jbar gene of Streptomyces hygroscopicus , 

The vectors of the present invention preferably contain 

15 an element (s) that permits stable integration of the vector 
into the host cell's genome or autonomous replication of the 
vector in the cell independent of the genome. 

For integration into the host cell genome, the vector may 
rely on the nucleotide sequence encoding the polypeptide or any 

20 other element of the vector for stable integration of the 
vector into the genome by homologous or nonhomologous 
recombination . 

Alternatively, the vector may contain additional 
nucleotide sequences for directing integration by homologous 

25 recombination into the genome of the host cell. The additional 
nucleotide sequences enable the vector to be integrated into 
the host cell genome at a precise location (s) in the 
chromosome (s) . 

To increase the likelihood of integration at a precise 

30 location, the integrational elements should preferably contain 
a sufficient number of nucleotides, such as 100 to 1,500 base 
pairs, preferably 400 to 1,500 base pairs, and most preferably 
800 to 1,500 base pairs, which are highly homologous with the 
corresponding target sequence to enhance the probability of 

35 homologous recombination. The integrational elements may be any 
sequence that is homologous with the target sequence in the 
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genome of the host cell. Furthermore, the integrational 
elements may be non-encoding or encoding nucleotide secjuences. 
On the other hand, the vector may be integrated into the genome 
of the host cell by non- homologous recombination. 
5 For autonomous replication, the vector may further 

comprise an origin of replication enabling the vector to 
replicate autonomously in the host cell in question. Examples 
of bacterial origins of replication are the origins of 
replication of plasmids pBR322, pUC19, pACYC177, and pACYC184 

10 permitting replication in E. coli, and pUBllO, pE194, pTAlOSO, 
and pAMSl permitting replication in Bacillus. Examples of 
origins of replication for use in a yeast host cell are the 2 
micron origin of replication, ARSl, ARS4, the combination of 
ARSl and CEN3, and the combination of ARS4 and CEN6 . An example 

15 of a filamentous fungal stabilizing element is the AMAl 
sequence. The origin of replication may be one having a 
mutation which makes its functioning temperature- sensitive in 
the host cell (see, e.g., Ehrlich, 1978, Proceedings of the 
National Academy of Sciences USA 75: 14 33) . 

20 More than one copy of a nucleotide sequence of the 

present invention may be inserted into the host cell to 
increase production of the gene product . An increase in the 
copy number of the nucleotide sequence can be obtained by 
integrating at least one additional copy of the sequence into 

25 the host cell genome or by including an amplifiable selectable 
marker gene with the nucleotide sequence where cells containing 
amplified copies of the selectable marker gene, and thereby 
additional copies of the nucleotide sequence, can be selected 
for by cultivating the cells in the presence of the appropriate 

30 selectable agent. 

The procedures used to ligate the elements described 
above to construct the recombinant expression vectors of the 
present invention are well known to one skilled in the art 
(see, e.g., Sambrook et al . , 1989, supra), 

35 The present invention also relates to recombinant a host 

cell comprising the polynucleotide (s) or nucleic acid 
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construct (s) of the invention, which are advantageously used in 
the screening assays described herein. A vector comprising a 
nucleotide sequence of the present invention is introduced into 
a host cell so that the vector is maintained as a chromosomal 
5 integrant or as a self -replicating extra-chromosomal vector as 
described earlier . 

The host cell may be a unicellular microorganism, e.g., a 
prokaryote, or a non-unicellular microorganism, e.g., a 
eukaryote . 

10 Useful unicellular cells are bacterial cells such as gram 

positive bacteria including, but not limited to, a Bacillus 
cell, e.g.. Bacillus alkalophilus, Bacillus amyloliquefaciens. 
Bacillus brevis, Bacillus circulans, Bacillus clausii. Bacillus 
coagulans, Bacillus lautus, Bacillus lentus. Bacillus 

15 licheniformis. Bacillus megaterium. Bacillus 

stearothermophilus , Bacillus subtilis, and Bacillus 
thuringiensis; or a Streptomyces cell, e.g., Streptomyces 
lividans or streptomyces murinus, or gram negative bacteria 
such as E. coli and Pseudomonas sp. 

20 In a preferred embodiment, the bacterial host cell is a 

Bacillus lentus, Bacillus licheniformis. Bacillus 
stearothermophilus, or Bacillus subtilis cell. In another 
preferred embodiment, the Bacillus cell is an alkalophilic 
Bacillus, 

25 The introduction of a vector into a bacterial host cell 

may, for instance, be effected by protoplast transformation 
(see, e.g., Chang and Cohen, 1979, Molecular General Genetics 
168; 111-115), using competent cells (see, e.g.. Young and 
Spizizin, 1961, Journal of Bacteriology 81: 823-829, or Dubnau 

30 and Davidof f -Abelson, 1971, Journal of Molecular Biology 56: 
209-221), electroporation (see, e.g., Shigekawa and Dower, 
1988, Biotechniques 6: 742-751), or conjugation (see, e.g., 
Koehler and Thorne, 1987, Journal of Bacteriology 169: 5771- 
5278) . 
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The host cell may be a eukaryote^ such as a mammalian, 
insect, plant, or fungal cell. 

In a preferred embodiment, the host cell is a fungal 
cell. "Fungi" as used herein includes the phyla Ascomycota, 
5 Basidiomycota, Chytridiomycota, and Zygomycota (as defined by 
Hawksworth et al,. In, Ainsworth and Bisby^s Dictionary of The 
Fungi, 8th edition, 1995, CAB International, University Press, 
Cambridge, UK) as well as the Oomycota (as cited in Hawksworth 
et al . , 1995, supra, page 171) and all mitosporic fungi 

10 (Hawksworth et al . , 1995, supra). 

In a more preferred embodiment, the fungal host cell is a 
yeast cell. '^Yeast" as used herein includes ascosporogenous 
yeast (Endomycetales) , basidiosporogenous yeast, and yeast 
belonging to the Fungi Imperfecti (Blastomycetes) . Since the 

15 classification of yeast may change in the future, for the 

purposes of this invention, yeast shall be defined as described 
in Biology and Activities of Yeast (Skinner, F.A,, Passmore, 
S . M . , and Davenport , R . R . , eds , Soc . App . Bac teri ol . Symposi xm 
Series No. 9, 1980) . 

20 In an even more preferred embodiment, the yeast host cell 

is a Candida, Hansenula, Kluyveromyces , Plchla, Saccharomyces , 
Schizosaccharomyces, or Yarrowia cell. 

In a most preferred embodiment, the yeast host cell is a 
Saccharomyces carlshergensis , Saccharomyces cerevislae, 

25 Saccharomyces diastaticus, Saccharomyces douglasii , 
Saccharomyces kluyveri , Saccharomyces norhensls or 
Saccharomyces ovlformis cell. In another most preferred 
embodiment, the yeast host cell is a Kluyveromyces lactis cell. 
In another most preferred embodiment, the yeast host cell is a 

30 Yarrowia llpolytica cell. 

In another more preferred embodiment, the fungal host 
cell is a filamentous fungal cell. ^^Filamentous fungi" include 
all filamentous forms of the subdivision Eumycota and Oomycota 
(as defined by Hawksworth et al., 1995, supra). The 

35 filamentous fungi are characterized by a mycelial wall composed 
of chitin, cellulose, glucan, chitosan, mannan, and other 
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complex polysaccharides. Vegetative growth is by hyphal 
elongation and carbon catabolism is obligately aerobic. In 
contrast, vegetative growth by yeasts such as Saccharomyces 
cerevlsiae is by budding of a unicellular thallus and carbon 
5 catabolism may be f ennentative . 

In an even more preferred embodiment, the filamentous 
fungal host cell is a cell of a species of, but not limited to, 
Acremonium, Aspergillus, Fusarixm, Humicola, Mucor, 
Myceliophthora, Neurospora, Penicillium, Thielavia, 
10 Tolypocladium, or Trichoderma. 

In a most preferred embodiment, the filamentous fungal 
host cell is an Aspergillus awamori, Aspergillus foetidus, 
Aspergillus japonicus, Aspergillus nidulans, Aspergillus niger 
or Aspex-grlllus oryzae cell. In another most preferred 

15 embodiment, the filamentous fungal host cell is a Fusarium 
bactridioides , Fusarium cerealis, Fusarium crookwellense, 
Fusarium culmorum, Fusarium graminearum, Fusarium graminum, 
Fusarium heterosporxmi, Fusarium negundi, Fusarium oxysporum, 
Fusarium reticulatum, Fusarium roseum, Fusarium sambucinum, 

20 Fusarium sarcochroum, Fusarium sporotrichioides , Fusarium 

sulphureum, Fusarium torulosxun, Fusarium trichothecioides, or 
Fusarium venenatuw cell. In an even most preferred embodiment, 
the filamentous fungal parent cell is a Fusarium venenatum 
(Nirenberg sp. nov.) cell. In another most preferred 

25 embodiment, the filamentous fungal host cell is a Humicola 
insolens, Humicola lanuginosa, Mucor miehei, Myceliophthora 
thermophila, Neurospora crassa, Penicillium purpurogenum, 
Thielavia terrestris, Trichoderma harzianum, Trichoderma 
koningii, Trichoderma longibrachia turn, Trichoderma reesei, or 

30 Trichoderma viride cell. 

Fungal cells may be transformed by a process involving 
protoplast formation, transformation of the protoplasts, and 
regeneration of the cell wall in a manner known per se. 
Suitable procedures for transformation of Aspergillus host 

35 cells are described in EP 238 023 and Yelton et al., 1984, 



10070. OlO'DK 



27 

Proceedings of the National Academy of Sciences USA 81: 1470- 
1474. Suitable methods for transforming Fusarium species are 
described by Malardier et al . , 1989, Gene 78: 147-156 and WO 
96/00787, Yeast may be transformed using the procedures 
5 described by Becker and Guarente, In Abelson, J.N. and Simon, 
M.I., editors. Guide to Yeast Genetics and Molecular Biology, 
Methods in Enzymology, Volume 194, pp 182-187, Academic Press, 
Inc., New York; Ito et al . , 1983, Journal of Bacteriology 153: 
163; and Hinnen et al . , 1978, Proceedings of the National 

10 Academy of Sciences USA 75: 1920. 

A plurality of recombined polynucleotides of the second 
aspect generated by the method of first aspect may be screened 
for a particular activity or property of interest, and a 
recombined polynucleotide of the third aspect generated by the 

15 method of the first aspect may be selected from the plurality 
of the second aspect based on the results of such a screening. 

An essential element in this screening process it the use 
of a plurality of recombined nucleotides of the second aspect 
generated by a method of the first aspect, in a screening assay 

20 for an activity or property of interest . 

Exeunples 
Example 1 

Construction of an expression plasmid 

25 Plasmids pENI1298 and pENI1299 were described in WO 

00/24883. The plasmid pENI1298 was further developed in order 
to decrease plasmid size (thus improve transformation) , improve 
expression (by improving promoter) , have expression in B. coii 
for improved library screening (by improving same promoter) , 

30 ease cloning (by introducing unique restriction sites 

downstream of promoter) , ease cloning (by using the Gateway 
cloning technology) 

Plasmid pENI1960 was made using the Gateway Vector*" 
conversion system (Lif etechnology* cat no, 11828-019) by cutting 

35 pENI1902 with BamHI, filling the DNA ends using Klenow fragment 
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polymerase and nucleotides (thus making blunt ends) followed by 
ligation to reading frame A Gateway™ PGR fragment- The cloning 
in the correct orientation was confirmed by sequencing, 

Plasmld pENI1902 was made in order to have a promoter 
5 that works in both E.coli and Aspergillus. This was done by 
unique site elimination using the '^Chameleon double stranded 
site-directed mutagenesis kit" as recommended by Stratagene*. 
Plasmid pENI1861 was used as template and the following primers 
were used as selection primers C^S'P" indicates a 5" 
10 phosphorylation of the primer) : 

177996 (SEQ ID 1) : 5"P GAATGACTTGGTTGACGCGTCACCAGTCAC 

135640 (SEQ ID 2) : 5'P CTTATTAGTAGGTTGGTACTTCGAG 

15 

135638 (SEQ ID 3) : 5'P GTCCCCAGAGTAGTGTCACTATGTCGAGGCAGTTAAG 

The 0803 99 Jl 9 primer was used as mutagenic primer to introduce 
a -35 and -10 promoter consensus sequence (from E.coli) in the 
20 Aspergillus expression promoter. Introduction of the mutations 
was verified by sequencing. 

080399J19 (SEQ ID 4) : 

5 "P GTATGTCCCTTGACAATGCGATGTATCACATGATATAATTACTAGCAAGGGAAGCCG 
25 TGCTTGG 

Plasmid pEN11861 was made in order to have the state of 
the art Aspergillus promoter in the expression plasmid, as well 
as a number of unique restriction sites for cloning. 
30 A PGR fragment (app. 620 bp) was made using pMT2188 (see 
Example 5) as template and the following primers: 

051199J1 (SEQ ID 5) : 

5' CCTCTAGATCTCGAGCTCGGTCACCGGTGGCCTCCGCGGCCGCTGGATCCCCAGTTGTG 

35 

1298TAKA (SEQ ID 6) : 5' GCAAGCGCGCGCAATACATGGTGTTTTGATCAT 
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The fragment was cut BssHII and Bgl II, and cloned into 
pENI1849 which was also cut with BssHII and Bgl II. The cloning 
was verified by sequencing, 

5 

Plasmld pENI1849 was made in order to truncate the pyrG 
gene to the essential secjuences for pyrG expression, in order 
to decrease the size of the plasmid, thus improving 
transformation frequency- A PGR fragment {app. 1800 bp) was 
10 made using pENI1299 as template and the following primers: 

2 70999 J8 (SEQ ID 7) : 5' TCTGTGAGGCCTATGGATCTCAGAAC 

270999J9 (SEQ ID 8) : 5' GATGCTGCATGCACAACTGCACCTCAG 

15 

The PGR- fragment was cut with the restriction enzymes 
StuI and SphI, and cloned into pENI12 98, also cut with StuI and 
SphI ; the cloning was verified by sequencing. 

20 Example 2 

Preparation of a cell extract comprising MutS mismatch 
repair enzymes from Thermus aquaticus ATCC25105. 

The cell extract is made as described by CGrady G,M., 
McCarthy T.V., Vaughan P.M. (1997) Biochemical Society 
25 Transactions vol 25 p. 319-322, 

Exaxnple 3 

Three primers are made based on the Thezwomyces 
lanuginosa lipase gene (EM6L accession no. : AF054513) , where 

30 each primer contains two in- frame stopcodons (shown below in 
bold typeface) and a silent mutation (shown below in underlined 
italic typeface) . In order to have a functional lipase gene, 
recombination has to take place between all three primers 
during the Recombinatorial Chain Reaction (RCR) method of the 

35 invention. After RCR the primers are subjected to a round of 
PGR and a cloning step. 
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The following reaction is mixed to a total of 25 /xl : 
10 fil of cell extract, 40 pmol of primer 251000j2, 40 pmol of 
primer 251000j3, 40 pmol of primer 251000j4 in buffer (20 mM 
Tris/HCl pH 7,5, 5 mM MgCl2, o.l mM Dithiothreitol , 0. 1 mM 
5 EDTA) . 

251000j2 (SEQ ID 9) : 

5 ' ATCGGGAATCTTAACTTCGACTAGTAAGAAATTTU^TGACATTTGCTCC 

10 251000j3 (SEQ ID 10) : 

5 ' ATCGGGAATCTTAACTTCGACTAGAAATAAAT3AATGACATTTGCTCC 

251000j4rev (SEQ ID 11) : 

5 ' GGAGCAAATGTCATTAATTTATTACAAGTCGAAGTTAAGATTCCCGAT 

15 

The reaction-mix is placed in a PGR machine (e.g. Perkin 
Elmer* 2400) and the following Recombinatorial Chain Reaction 
(RCR) cycle was run 94**C 5 min, 30 times (94**C, 30 sec, 45''C 1 
min, 72°C, 3 min) . After the RCR reaction, the mix is purified 
20 by being spun on BioRad* Columns (Micro Bio-spin P-S""; Biorad*) . 
Two PGR reactions as listed below are run at the 
following protocol: 94°C, 5 min; 30 cycles of {94*'C, 30 sec; 
SO'^C, 1 min; 72°C, 2 min) ; 72*'C, 5min. 

25 PCRl (app. 42 0 bp) : 

Use pENI1298 as template, 20 pmol primer 19072000 jl and 1 
ill of the purified RCR extract in a total of 20 /zl, along with 
PWO™ DNA polymerase (Boehringer Mannheim"") . 

30 Primer 19072000J1 (SEQ ID 12) : 

5 ' GGGGACAAGTTTGTACAAAAAAGCAGGCTTCTCTGAACAATAAACCCCAC 

PCR2 (app, 600 bp); 

Use PENI1298 as template, 2 0 pmol primer 1 60 62 000 j 2 and 1 
35 /il of the purified RCR extract in a total of 20 ^1, along with 
PWO™ DNA polymerase (Boehringer Mannheim™) . 
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Primer 16062000j2 (SEQ ID 13) : 

5 ' GGGGACCACTTTGTACAAGAAAGCTGGGTCCTAGATCTCGAGCTCGGTCAC 

The PGR fragments PCRl and PCR2 are purified from a 1.5 % 
5 agarose using a Qiagen™ gel extraction kit. A third PGR (PCR3) 
is run using 1 micro 1 of each purified PGR fragment, 20 pmol 
primer 16062000 j2, 20 pmol primer 19072000 jl in a total of 20 
/il, along with PWO™ polymerase (Boehringer Mannheim"*) . The PGR 
fragment PCR3 is purified using BioRad™ Columns (Micro Bio-spin 
10 P-6'") . 

The BP cloning reaction is made by mixing the PCR3 
fragment with BP reaction buffer, pDONR2 01 vector and BP 
clonase mix (as recommended by Lif etechnology*) . Plasmid 
pENI1960 is cut with Seal (in order to cleave in the ccdB 

15 gene) , and mixed with the BP reaction and LR clonase mix (as 
recommended by Lif etechnology*) and transformed into E. coli 
DHlOb (Life-Technology™ cat. no . 18290-015) . 

The transformation is plated onto A+B minimal plates 
(Clark and Maaloe (1967) J. Mol Biol. Vol 23 p. 99- 112) 

20 supplemented with 1% olive oil , ampicillin (100 fig/ml) and 25 
/ig/ml leucine- Only those E.coli transf ormants that have the 
plasmid and express a functional lipase will grow on these 
plates . 

A DNA preparation is made from each trans formant and 
25 sequenced to confirm the functionality of the lipase gene, as 
well as to identify the silent mutation originating from the 
primers, thus confirming the that the recombination event 
during the RCR has taken place as expected. 

30 Exaznple 4 

Three primers are made based on the Thermomyces 
lanuginosa lipase gene. Each primer contains 2 in- frame 
stopcodons (shown below in bold typeface) and a silent mutation 
(shown below in underlined italic typeface) . In order to have a 
35 functional lipase gene, recombination has to take place between 
all three primers during the Recombinatorial Chain Reaction 
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(RCR) method of the invention. After RCR the primers are 
subjected to a round of PGR and a cloning step. 

The following reaction is mixed in a total of 25 fil : 
10 fil of cell extract, 40 pmol of primer 251000j2, 40 pmol of. 
5 primer 251000j3, 40 pmol of primer 251000j4 in buffer (20 mM 
Tris/HCl pH 7.5, 5 mM MgCl2, o.l mM Dithiothreitol , 0,1 mM 
EDTA) . 

The reaction-mix is placed in a PGR machine (e.g. Perkin 
Elmer* 2400) and the following Recombinatorial Ghain Reaction 
10 (RCR) cycle was run 94''C 5 min, 30 times (94**^ 30 sec, 45''G 1 
min, 72®C, 3 min) . After the RCR reaction, the mix is purified 
by being spun on BioRad* Columns (Micro Bio- spin P-6™; Biorad*) . 

Two PGR reactions as listed below are run at the 
following protocol: 94*'C, 5 min; 30 cycles of (94''C, 30 sec; 
15 50''C, 1 min; 72°G, 2 min); 72*'C, 5min, 

PGR 1 (app. 420 bp) : 

Use pENI1298 as template, 20 pmol primer 19671 and 1 /il 
of the purified RCR extract in a total of 20 fxl , along with 
20 PWO™ DNA polymerase (Boehringer Mannheim™) . 

Primer 19671 (SEQ ID 14) : 5' CTCCCTTCTCTGAACAATAAACCC 

PGR 2 (app. 600 bp): 
25 Use pENI1298 as template, 20 pmol primer 991213J5 and 1 

fil of the purified RCR extract in a total of 20 /xl, along with 
PWO™ DNA polymerase (Boehringer Mannheim"") . 

Primer 991213J5 (SEQ ID 15) : 
30 5 ' GGTCTAGATCTCGAGCTCGGTCACCGGTGGCCTCCGGGGCGGCTGCGGCAGGTGTCAGTC 
ACCCTC 

The two PGR fragments PCRl and PGR2 are purified from a 
1.5 % agarose using a Qiagen* gel extraction kit. A third PGR 
35 (PCR3) is run using 1 fxl of each purified PGR fragment, PCRl 
and PCR2, 100 pmol primer 19671, 100 pmol primer 991213 J5 in a 
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total of 100 /il, along with PWO™ polymerase (Boehringer 
Mannheim™) . After the reaction, the PCR3 fragment is purified 
from the mix by being spun on BioRad"* Columns (Micro Bio- spin 
P-6™) . 

5 The PGR fragment PCR3 and pENI 1902 are both cut BamHI 

and SacII and the cut fragments are purified from a 1 % agarose 
gel using a Qiagen™ gel extraction kit. The fragment and the 
PENI1902 vector are ligated and transformed into E. coli DHlOb. 
The transformation is plated onto A+B minimal plates 
10 (Clark and Maaloe (1967) J. Mol Biol. Vol 23 p, 99- 112) 

supplemented with 1 % olive oil , 100 /ig/ml ampicillin and 25 
/xg/ml leucine- Only those E.coli transformants that have the 
plasmid and express a functional lipase will grow on these 
plates. 

15 A DNA preparation is made from each transformant and 

sequenced to confirm the functionality of the lipase gene, as 
well as to identify the silent mutation originating from the 
primers, thus confirming the that the recombination event 
during the RCR has taken place as expected. 

20 

Excunple 5 

Construction of the expression plasmid pMT2188 

The Aspergillus oryzae expression plasmid pCaHj 483 (WO 
98/00529) consists of an expression cassette based on the 

25 Aspergillus niger neutral amylase II promoter fused to the 

Aspergillus nidulans triose phosphate isomerase non translated 
leader sequence (Pna2/tpi) and the A, niger amyloglycosidase 
terminater (Tamg) . Also present on the plasmid is the 
Aspergillus selective marker amdS from A. nidulans enabling 

30 growth on acetamide as sole nitrogen source. These elements are 
cloned into the E. coli vector pUC19 (New England Biolabs) . The 
ampicillin resistance marker enabling selection in E. coli of 
this plasmid was replaced with the URA3 marker of Saccharomyces 
cerevisiae that can conplement a pyrF mutation in E. coli, the 

35 replacement was done in the following way: 
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The pUC19 origin of replication was PGR amplified from 
pCaHj483 with the primers: 

142779 (SEQ ID 16) : 5' TTGAATTGAAAATAGATTGATTTAAAACTTC 
5 142780 (SEQ ID 17) : 5' TTGCATGCGTAATCATGGTCATAGC 

Primer 142780 introduces a Bbul site in the PGR fragment. 
The Expand PGR system (Roche Molecular Biochemicals, Basel, 
Switserland) was used for the amplification following the 
10 manufacturers instructions for this and the subseqpient PGR 
amplifications . 

The URA3 gene was amplified from the general S. 
cerevisiae cloning vector pYES2 (Invitrogen corporation, 
Garlsbad, Ca, USA) using the primers: 

15 

14 02 8 8 {SEQ ID 18) : 5' TTGAATTCATGGGTAATAACTGATAT 
142778 (SEQ ID 19) : 5' AAATCAATCTATTTTGAATTCAATTGATCATT 

Primer 14 02 8 8 introduces an EcoRI site in the PGR 
20 fragment. The two PGR fragments were fused by mixing them and 

amplifying using the primers 142780 and 140288 in the splicing 

by overlap method {Horton et al (1989) Gene, 77, 61-68) . 

The resulting fragment was digested with KcoRI and Bbul 

and ligated to the largest fragment of pGaHj 4 83 digested with 
25 the same enzymes. The ligation mixture was used to transform 

the pyrF' E.coli strain DB6507 (ATCG 35673) made competent by 

the method of Mandel and Higa (Mandel, M. and A. Higa (1970) J. 

Mol . Biol. 45, 154). Transformants were selected on solid M9 

medium (Sambrook et. al (1989) Molecular cloning, a laboratory 
30 manual, 2. edition. Cold Spring Harbor Laboratory Press) 

supplemented with 1 g/1 casaminoacids, 500 |ig/l thiamine and 10 

mg/1 kanamycin. 

A plasmid from a selected transformant was termed pGaHj 

527. ThePna2/tpi promoter present on pCaHj527 was subjected to 
35 site directed mutagenises by a simple PGR approach. 
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Nucleotide 134 - 144 was altered from GTACTAAAACC to 
CCGTTAAATTT using the mutagenic primer 141223. 

141223 (SBQ ID 20} : 
5 5 ' GGATGCTGTTGACTCCGGAAATTTAACGGTTTGGTCTTGCATCCC 

Nucleotide 423 - 436 was altered from ATGCAATTTAAACT to 
CGGCAATTTAACGG using the mutagenic primer 141222: 

10 141222 (SEQ ID 21) : 

5 ' GGTATTGTCCTGCAGACGGCAATTTAACGGCTTCTGCGAATCGC 

The resulting plasmid was termed pMT2188. 



15 
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CLAIMS 

1. A method for forming a plurality of recombined homologous 
double- stranded polynucleotides from at least two homologous 
double -stranded template polynucleotides, said method 

5 comprising the steps of: 

a) providing a solution comprising at least two non- 
methylated homologous double -stranded template 
polynucleotides and one or more mismatch repair 
protein (s> ; 

10 b) denaturing the template polynucleotides into single- 
stranded polynucleotides; 

c) annealing the different single- stranded polynucleotides, 
wherein heteroduplexes are formed; 

d) allowing the mismatch repair protein (s) to repair 

15 nucleotide mismatches in the heteroduplexes, wherein 

recombined new duplexes are formed; and 

e) optionally, repeating steps b) through d) for one or more 
cycles; wherein the new duplexes of step d) serve as new 
template polynucleotides in step b) in each subsequent 

20 cycle. 

2. The method of claim 1, wherein the at least two 
homologous double -stranded template polynucleotides are 
obtained by PGR amplification. 

25 

3. The method of claims 1 or 2, wherein the at least two 
homologous double -stranded template polynucleotides encode 
homologous polypeptides. 

30 4. The method of any of claims 1-3, wherein the at least 
two homologous double -stranded template polynucleotides encode 
homologous enzymes, preferably amylases, proteases, cellulases, 
lipases, xylanases, or phospholipases . 
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5. The mehod of any of claims 1-4, wherein the solution 
comprises a population of cells or a lysate of a population of 
cells . 

5 6. The method of claim 5, wherein the population of cells or 
the lysate of a population of cells comprises the at least two 
homologous double -stranded template polynucleotides. 

7. The method of claims 5 or 6, wherein the population of 
10 cells or the lysate of a population of cells comprises the 

mismatch repair protein (s) . 

8. The method of any of claims 5-7, wherein the population 
of cells, or the population of cells giving rise to the lysate, 

15 do not methylate newly synthesized polynucleotides. 

9. The method of any of claims 1-8, wherein the mismatch 
repair protein (s) is (are) thermostable. 

20 10. The method of any of claims 1-9, wherein the 

thermostable mismatch repair protein (s) comprises a MutS 
horaologue, preferably MutS YTl of Thermus aquaticus. 

11. The method of any of claims 1-9, wherein the 

25 thermostable mismatch repair protein (s) comprises a MutL 
homologue, a MSH2 homologue, a MSH6 homologue, a MutM 
homologue, a MutY homologue, a MutT homologue, a MutH 
homolocfue, a HexA homologue, a HexB homologue, or a GTBP/pl60 
homolog. 

30 

12. The method of any of claims 1-11, wherein the 
denaturing is achieved by increasing the temperature of the 
solution, preferably to at least 90^C. 
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13. The method of claim 12, wherein the annealing is achieved 
by lowering the temperature of the solution, preferably at 
least to a temperature at which the mismatch repair protein (s) 
functions, more preferably at least to between 55°C and 75°C. 

14. The method of any of claims 1-13, wherein steps b) 
through d) are repeated for between 1 and 10 cycles; wherein 
the new duplexes of step d) serve as new template 
polynucleotides in step b) in each subsequent cycle . 

15. The method of any of claims 1-13, wherein steps b) 
through d) are repeated for at least 10 cycles; wherein the new 
duplexes of step d) serve as new template polynucleotides in 
step b) in each subsecjuent cycle. 



16. The method of any of claims 1-15, wherein additional 
steps are performed, said additional steps comprising: 

f) generating a gene library by cloning the plurality of 
recombined polynucleotides; 
20 g) expressing and screening the gene library for an activity 
or property of interest; and 
h) isolating or identifying the recombined polynucleotide 
which gives rise to the activity or property of interest. 

25 17. A plurality of recombined polynucleotides generated by a 
method as defined in any of the claims 1-16. 

18. A recombined polynucleotide generated by a method as 
defined in any of the claims 1-16. 

30 

19. Use of a plurality of recombined polynucleotides 
generated by a method as defined in any of the claims 1-16, 
in a screening assay for an activity or property of interest. 



35 20. Use of a recombined polynucleotide generated by a method 
as defined in any of the claims 1-16, for expression or 
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production of a polypeptide of interest 



.010-DK 



40 

ABSTRACT 

A method for making polynucleotide libraries that 
comprise new recombined polynucleotides by using mismatch 
repair enzymes or proteins to shuffle at least two variants of 
the same polynucleotide or at least two homologous 
polynucleotides through a Recombinatorial Chain Reaction, RCR. 
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SEQUENCE LISTING 

<110> Novozymes. A/S 
Jesp r, Vind 

<120> Method for Producing a Polynucleotide Library 
<130> 10070. 000-DK 
<160> 15 

<170> Patentin version 3.0 



<210> 1 

<211> 30 

<212> DNA 

<213> Primer 177996 
<220> 

<221> misc_feature 

<222> (1) . - (1) 

<223> 5* Phosphorylation 

<400> 1 

gaatgacttg gttgacgcgt caccagtcac 30 

<210> 2 

<211> 25 

<212> DNA 

<213> Primer 135640 
<220> 

<221> misc_feature 

<222> (1) . . (1) 

<223> 5' Phosphorylation 

<400> 2 

cttattagta ggttggtact tcgag 25 

<210> 3 

<211> 37 

<212> DNA 

<213> Primer 135638 
<220> 

<22l> misc_feature 

<222> (1)..(1) 

<223> 5' Phosphorylation 

<400> 3 

gtccccagag tagtgtcact atgtcgaggc agttaag 37 

<210> 4 

<211> 64 

<212> DNA 

<r213> Primer 080399J19 



<220> 

<221> misc_feature 
<222> (1)..{1) 
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<223> 5' Phosphorylation 



<400> 4 

gtatgtccct tgacaatgcg atgtatcaca tgatataatt actagcaagg gaagccgtgc 60 
ttgg 64 

<210> 5 

<211> 59 

<212> DMA 

<213> Primer 051199J1 

<400> 5 

cctctagatc tcgagctcgg tcaccggtgg cctccgcggc cgctggatcc ccagttgtg 59 

<210> 6 

<211> 33 

<212> DNA 

<213> Primer 1298TAKA 



<400> 6 

gcaagcgcgc gcaatacatg gtgttttgat cat 33 

<210> 7 

<211> 26 

<212> DNA 

<213> Primer 270999J8 



<400> 7 

tctgtgaggc ctatggatct cagaac 26 

<:210> 8 
<211> 27 
<212> DNA 

<213> Primer 270999J9 
<400> 8 

gatgctgcat gcacaactgc acctcag 27 



<210> 9 

<211> 48 

<212> DNA 

<213> Primer 25l000j2 
<220> 

<221> misc_signal 

<222> (22) . . (27) 

<223> Two stop-codons 

<220> 

<221> misc_signal 

<222> (33) . . (33) 

<223> Silent mutation 



<400> 9 

atcgggaatc ttaacttcga ctagtaagaa attaatgaca tttgctcc 48 

<210> 10 

<211> 48 

<212> DNA 

<213> Primer 251000j3 



Page 2 



10070 . 010 . ST25 . txt 



<220> 

<221> inisc_signal 

<222> (22) . . (24) 

<223> Stop-codon 

<220> 

<221> misc__signal 

<222> (28) . . (30) 

<22 3> Stop-codon 

<220> 

<221> misers ignal 

<222> (33) . . (33) 

<223> Silent mutation 

c400> 10 

atcgggaatc ttaacttcga ctagaaataa attaatgaca tttgctcc 

<210> 11 

<211> 48 

<212> DNA 

<213> Primer 251000j4rev 
<220> 

<221> misc_signal 

<222> (16) . . (16) 

<223> Silent mutation 

<220> 

<221> miBC_signal 

<222> (19) . . (24) 

<223> Two Stop-codons 

<400> 11 

ggagcaaatg tcattaattt attacaagtc gaagttaaga ttcccgat 

<210> 12 

<211> 50 

<212> DNA 

<213> Primer 19072000J1 

<400> 12 

ggggacaagt ttgtacaaaa aagcaggctt ctctgaacaa taaaccccac 

<210> 13 

<211> 51 

<212> DNA 

<213> Primer 16062000j2 

<400> 13 

ggggaccact ttgtacaaga aagctgggtc ctagatctcg agctcggtca 

<210> 14 

<:211> 24 

<212> DNA 

<213> Primer 19671 

<400> 14 

ctcccttctc tgaacaataa accc 
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<210> 15 
<211> 66 
<212> DNA 

<213> Primer 991213J5 
<400> 15 

cctctagatc tcgagctcgg tcaccggtgg cctccgcggc cgctgcgcca ggtgtcagtc 60 
accctc 66 



<210> 16 
<211> 31 
<212> DNA 

<213> Primer 142779 
<40G> 16 

ttgaattgaa aatagattga tttaaaactt c 31 



<210> 17 

<211> 25 

<212> DNA 

<213> Primer 142780 

<400> 17 

ttgcatgcgt aatcatggtc atagc 25 

<:210> 18 
<211> 26 
<212> DNA 

<213> Primer 140288 
<400> 18 

ttgaattcat gggtaataac tgatat 26 

<210> 19 

<211> 32 

<212> DNA 

<213> Primer 142778 

<400> 19 

aaatcaatct attttcaatt caattcatca tt 32 



<210> 20 

<211> 45 

<212> DNA 

<213> Primer 141223 

<400> 20 

ggatgctgtt gactccggaa atttaacggt ttggtcttgc atccc 45 
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